Netflix Genie with Tom Gianos | Software Engineering Daily
Update: 2017-07-12
Description
http://traffic.libsyn.com/sedaily/genie_tomgianos.mp3Podcast: Play in new window | Download
“Sometimes there’s a misconception that Genie is a job scheduling platform… Genie really represents our extraction layer, from what our computational resources are, to our end user jobs.”
Genie is an open-source tool that provides job and resource management for the Hadoop ecosystem in the cloud.
Tom Gianos is a Senior Software Engineer at Netflix focusing on its big data platform. He is one of the core contributors in charge of maintaining and improving Genie.
Questions
How fast is the pace of Netflix data collection increasing?
What are the differences between running Hadoop in the cloud and running it in a data center?
Why is job and resource management so important for a Hadoop ecosystem?
What does Netflix do with spare computational resources?
What is a bonus cluster?
Does Netflix collect more data than it can practically make use of?
Links
Genie
Yarn – Resource Scheduling Layer
Hadoop Platform as a Service in the Cloud
SE Daily Episode on Presto
Spring Boot
Share this:Click to share on Twitter (Opens in new window)Click to share on Facebook (Opens in new window)Click to share on LinkedIn (Opens in new window)Click to email this to a friend (Opens in new window)
Related
https://softwareengineeringdaily.com/2015/10/13/netflix-genie-with-tom-gianos/
“Sometimes there’s a misconception that Genie is a job scheduling platform… Genie really represents our extraction layer, from what our computational resources are, to our end user jobs.”
Genie is an open-source tool that provides job and resource management for the Hadoop ecosystem in the cloud.
Tom Gianos is a Senior Software Engineer at Netflix focusing on its big data platform. He is one of the core contributors in charge of maintaining and improving Genie.
Questions
How fast is the pace of Netflix data collection increasing?
What are the differences between running Hadoop in the cloud and running it in a data center?
Why is job and resource management so important for a Hadoop ecosystem?
What does Netflix do with spare computational resources?
What is a bonus cluster?
Does Netflix collect more data than it can practically make use of?
Links
Genie
Yarn – Resource Scheduling Layer
Hadoop Platform as a Service in the Cloud
SE Daily Episode on Presto
Spring Boot
Share this:Click to share on Twitter (Opens in new window)Click to share on Facebook (Opens in new window)Click to share on LinkedIn (Opens in new window)Click to email this to a friend (Opens in new window)
Related
https://softwareengineeringdaily.com/2015/10/13/netflix-genie-with-tom-gianos/
Comments
In Channel




